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Abstract. We investigate the probability distributions of the recurrence intervals r between 
consecutive 1-min returns above a positive threshold q > or below a negative threshold 
q < of two indices and 20 individual stocks in China's stock market. The distributions of 
recurrence intervals for positive and negative thresholds are symmetric, and display power- 
law tails tested by three goodness-of-fit measures including the Kolmogorov-Smirnov (KS) 
statistic, the weighted KS statistic and the Cramer-von Mises criterion. Both long-term and 
shot-term memory effects are observed in the recurrence intervals for positive and negative 
thresholds q. We further apply the recurrence interval analysis to the risk estimation for the 
Chinese stock markets based on the probability W q (At, f), Value-at-Risk (VaR) analysis and 
VaR analysis conditioned on preceding recurrence intervals. 
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1. Introduction 

The recurrence interval analysis has caused a growing interest in extreme value statistics, 
and has been extensively studied in a variety of experimental time series, such as records 
of climate |l], yy, seismic activities and turbulence [4]. In recent years, a great deal of 
financial data have been compiled up and thus makes it possible to do relatively accurate 
analysis of extreme events in stock markets. By investigating the recurrence intervals between 
extreme events in stock markets, we can better understand the statistical properties of these 
extreme events. This is supposed to be of great importance for risk assessment. The recurrence 
intervals between volatilities exceeding a certain threshold q have been carefully studied, and 
numerous phenomena have been unveiled 

pape 

intervals between large price returns Inl . 1 1 8L 1 191 . |20j]. It has been verified that the long 
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On the other hand, only a few papers have been devoted to the study of recurrence 



term correlation of the time series has a remarkable influence on the recurrence interval 
distribution. Numerical simulation studies have shown that for the linear long-term correlated 



time series the recurrence interval distribution follows a stretched exponential B2L|21i|22l|230. 
Santhannam et al. subsequently provided an analytical expression for recurrence intervals 
distribution in linear long-term correlated time series, and claimed that the distribution is 



a combination of stretched exponential and power law form Q24f|. Recently, Bogachev and 
Bunde studied the recurrence interval distribution for the artificial multifractal signals which 
have non-linear correlation, and power-law behavior whose exponent varies with the threshold 



value was observed ||18l.l25|l . The multifractal structure has proven to be the main feature of 
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financial records, and consequently the recurrence interval distribution should have power-law 
behavior. Power-law behavior with exponent depending on the threshold value was indeed 
observed in a variety of representative financial records 15111911. 

Since the stock market crash occurring in May 2007, the Chinese stock markets 
experienced a strongly fluctuating period. The increasing number of the extreme events with 
large price returns offers an opportunity to better understand the statistic properties of the 
recurrence intervals. We investigate the reoccurring intervals between price returns above a 
threshold q > or below a threshold q < using the high-frequency data in recent years, and 
find tails of the recurrence interval distributions obey power-law scaling, independent of the 
threshold value. This scaling behavior is consistent with some empirical results of volatility 
recurrence intervals ElLEil]- The present recurrence interval analysis mainly focuses 

on the probability distribution and memory effect of the recurrence intervals. Another purpose 
of this paper is to make risk estimation for the Chinese stock markets based on the recurrence 
interval analysis. We note that the empirical analysis on the recurrence of financial returns 



was carried out using daily data [| 1 VL Il8l LL9l |20j]. In this work, we investigate the recurrence 



interval of 1-min high-frequency returns to gain better statistics. 

The paper is organized as follows. In Section 2, we explain the database analyzed. 
Section 3 studies the probability distribution of recurrence intervals using the Kolmogorov- 
Smirnov (KS) and Cramer-von Mises (CvM) tests. Section 4 further studies the memory 
effects of recurrence intervals. In Section 5, we attempt to perform the risk estimation based 
on the recurrence interval analysis. Section 6 concludes. 

2. Data sets 

Our analysis is based on a database of China's stock market retrieved from GTA Information 



Technology Co., Ltd (http://www.gtadata.com/). We study the 1-min intraday data of 20 liquid 



stocks actively traded on the Shanghai Stock Exchange and the Shenzhen Stock Exchange 
from January 2000 to May 2009, the Shanghai Stock Exchange Composite Index (SSEC) and 
the Shenzhen Stock Exchange Composite Index (SZCI) from January 2003 to April 2009. 
Since the sampling time is 1 minute, the number of data points is about 340000 for each of 
the two Chinese indices and about 500000 for the individual stocks. These 20 stocks are 
actively traded stocks representative in a variety of industry sectors. Each stock is uniquely 
identified with a stock code which is a unique 6-digit number. A stock with the code initiated 
with 600 is traded on the Shanghai Stock Exchange, while a stock with the code initiated with 
000 is listed on the Shenzhen Stock Exchange. 

3. Probability distribution of recurrence interval of price returns 

We study the recurrence intervals between extreme events with large positive price returns or 
negative price returns. The price return is defined as the logarithmic difference between two 
consecutive minutes, that is, 

R(t) = \nY(t)-lnY(t- 1) (1) 
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where Y(t) is minute price at time t. Before we calculate the recurrence interval, we normalize 
the price return of each stock by dividing its standard deviation 

R(t) 



r(t) = 



(2) 



[<i?(?) 2 > - (R(t)) 2 Y' 2 

We investigate the interval r = r q between two consecutive price returns above a positive 
threshold q > or below a negative threshold q < as illustrated in figure [Q and then 
study the statistics of these recurrence intervals between consecutive large price increases or 
decreases. 



2 
1 



q=l 



q = -l 
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Figure 1. Illustration of recurrence intervals of the normalized returns of the SSEC index. 



3.1. PDF of recurrence interval of price returns 

We first calculate the empirical probability distribution function (PDF) P q (r) of recurrence 
intervals of price returns. In figure [2l the scaled PDFs P q (r)(T) are plotted as a function of the 
scaled recurrence intervals t/(t) for both positive and negative thresholds for the two Chinese 
indices and four representative stocks, where <r) is the mean recurrence interval. For both 
the Chinese indices and individual stocks, one observes P q (r)(r) for q > shows a profile 
very similar to that for q < with the same magnitude of thresholds, e.g. q = 2 and q = -2. 
This indicates that the recurrence interval distributions for positive and negative thresholds 
are symmetric. 

We find that, for small r/(r), the scaled PDFs for different q values differ from each 
other, especially for the two Chinese indices, which is partly due to the discreteness effect. In 
contrast, the tails of the scaled PDFs nicely collapse onto a single curve displaying a scaling 
behavior. Speaking differently, P q (T)(r) only depends on r/<r) as 

P q (r) = tV(t/<t», (3) 
and do not depend on the threshold q when t/(t) is large enough. 
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Figure 2. Empirical probability distributions of scaled recurrence intervals for different 
thresholds q = -5, -4, -3, -2, 2, 3, 4, 5 for (a) SSEC, (b) SZCI, and (c)-(e) four representative 
stocks 600009, 600060, 000002 and 000063. The solid curves are the fitted functions ex' 6 
with parameters listed in Table Q] 



3.2. Fitting the scaling function of recurrence interval PDFs 

We then focus on the tail of recurrence interval PDF and study the particular form of the 
scaling function. The curves in figure [2] suggest that the tails of the PDFs may follow power- 
law form. We assume that the empirical PDF above x m i„ obeys a scaling form as in Eq. © 
and the scaling function follows 

/(t/<t» = f(x) = cx~ 5 , x > x min (4) 

where x m j n is the lower bound of the power-law distribution. Since our hypothesis is that 
the empirical PDFs for different q values above x m \ n are coincident with their common best 
power-law fit, we aggregate the interval samples for different q values above Xmm , and fit them 
using a common power-law function. 

To make an accurate estimation of the parameters for this power law distribution, we use 
a method proposed by Clauset, Shalizi and Newman using maximum likelihood method based 



on the Kolmogorov-Smirnov (KS) statistic Ir26ll . Supposing F q is the cumulative distribution 



function (CDF) for empirical data and F PL the CDF of power-law fit. The KS statistic is 
defined as 

KS = max (\F - F PL |) . (5) 

X— -*'min 

With a simple fundamental idea that making the empirical PDF and the best power-law fit 
as similar as possible, the estimate x m i n is determined by minimizing the KS statistic. The 
parameters c and 6 are estimated using maximum likelihood method. The fitted power law 
lines are illustrated in figure [2] with the estimated parameters x m [ n , 6, c and resultant KS 
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statistic depicted in Table [IJ Among the 20 individual stocks, there are 16 stocks which have 
*min ^10 showing a scaling region larger than one order of magnitude, and their power-law 
exponents are estimated to be 3.0 + 0.3. The two Chinese indices have scaling regions more 
than two orders of magnitude with power-law exponents 2.2 + 0.1. 

Table 1. KS , KS W and CvM tests of the recurrence interval distributions for different q values 
by comparing empirical data with the best fitted distribution and synthetic data with the best 
fitted distribution. 
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We have shown how to fit the empirical recurrence interval PDFs and provide good 
estimation of the parameters. It is necessary to test how good the power law fits the 
empirical PDF. We further perform the goodness-of-fit test using KS statistic. In doing so, 
the bootstrapping approach is adopted 11261 12711 . We first generate 1000 synthetic samples 
from the power-law distribution that best fits the empirical distribution, and then reconstruct 
the cumulative distribution F s i m of each simulated sample and its CDF F s i mPL of the best 
power-law fit. We calculate the KS statistic for the synthetic data from 

KS sim = max (\F sim - F simjPL |) . (6) 
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This KS statistic is relatively insensitive on the edges of two cumulative distributions. To 
avoid this problem we use a weighted KS statistic defined as 1I27I1 



KS W,\ m = max 



sim,PL I 



(7) 



V^sim,PL(l _ ^sim,PL), 

Thus it could be uniformly sensitive across the whole range. The p-value is determined by 
the frequency that KS s j m > KS or KS W s i m > KS W, where KS W is the weighted statistic for 
the empirical data. The tests are carried out for the two Chinese indices and the 20 individual 
stocks, and the resultant p-values are depicted in Tabled] 

A /rvalue close to 1 indicates that the empirical PDFs for different q values are coincident 
with their common power-law fit as good as the synthetic data generated from the power-law 
fit, whereas a relative small p suggests that the empirical PDFs could not be well described 
by their common power-law fit. We consider the significance level of 1%. If the p- value of 
an individual stock is less than 1%, then the null hypothesis that the empirical PDFs of this 
stock can be well fitted by their common power-law fit is rejected. As shown in Table [Q the 
null hypothesis is rejected for four stocks (600000, 600058, 600100, 600104) using the KS 
statistic and for SSEC using the KS W statistic. Based upon the fact that 16 stocks (out of 20 
stocks analyzed) pass the test using both KS and KS W statistics, we can conclude that for 
most of the stocks the tails of recurrence interval PDFs obey scaling behavior and the scaling 
function could be nicely fitted by a power law. 

To further test the goodness of this power-law fit, we use another goodness-of-fit measure 
based on the Cramer-von Mises (CvM) statistic 



W l = N f {F-F PL fdF PL , (8) 

%J -co 

where F is the CDF of empirical data, F PL is the CDF of the power-law fit, and N is the total 
number of scaled interval samples x = t/(t) |28l[29i[30|]. For a sequences of scaled interval 
samples x\,x 2 , - ■ ■ , x N , arranged in ascending order, W 2 could be calculated from 

2 1 #W 2i-l\ 2 

1=1 x ' 

Consider the significance level of 1% and N » 1, if the W 2 statistic is greater than a critical 
value 0.743 (see the critical values for different significance levels in Refs. 1281 1290), the 
hypothesis that the empirical PDFs coincident with their common power-law fit is rejected. 
The CvM tests are carried out for the two Chinese indices and the 20 stocks, and the 
corresponding values of statistic W 2 are depicted in the last column of Table [TJ Two stocks 
(600000, 600100) show W 2 greater than the critical value, thus fail in the test and the tails of 
their empirical PDFs consequently could not be approximated by the power-law distribution. 
18 stocks (out of 20 stocks analyzed) and two Chinese indices pass the CvM test, which 
further confirms our results that for most of the stocks the tails of recurrence interval PDFs 
could be nicely fitted by the power-law distribution. The CvM test provides similar results to 
the KS test, and principally the W 2 statistic is smaller when the p-value of a stock is larger. 
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Figure 3. Scaled conditional PDF P ci {t\tq)(t) of scaled recurrence intervals t/(t) with To in 
the largest 1/4 subset (open symbols) and smallest 1/4 subset (filled symbols) for (a) SSEC, 
(b) SZCI, and (c)-(e) four representative stocks 600009, 600060, 000002 and 000063. The 
solid lines are guidelines for eyes. 



4. Memory effect in recurrence interval of price returns 

To fully understand the statistical properties of the recurrence intervals of price returns, we 
further investigate the temporal correlation between them. Empirical studies have revealed 
that there exists a memory effect in the volatility recurrence intervals of various stock markets 
[0, 0, 11 , 13, 15]. In contrast, to the best of our knowledge, the memory effects of the 



recurrence intervals of financial returns have not been investigated ExsL . Indeed, we 



do observe memory effects in the recurrence intervals between large positive and negative 
returns in the Chinese stock markets. Since the results of the recurrence intervals between 
positive returns show very similar behavior to that of the recurrence intervals between negative 
returns, we mainly show the results of the recurrence intervals between consecutive price 
returns below negative thresholds q < 0. 



4.1. Conditional PDF 

We first investigate the short-term memory by calculating the conditional PDF P q (T\To) of 
recurrence intervals. P q (T\T ) is defined as the probability of finding interval r conditioned on 
the preceding interval To. We study the conditional PDF for a bin of To in order to get better 
statistics. The entire interval sequences are arranged in ascending order and partitioned to four 
bins with equal size. Figure |3]plots the scaled conditional PDF P 9 (t|to)(t) of the two Chinese 
indices and four representative stocks as a function of the scaled recurrence intervals t/(t) 
for t in the smallest and biggest bins, marked with filled and open symbols, respectively. 
The symbols for all negative thresholds for tq in the smallest and biggest bins approximately 
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collapse onto two separate solid curves as shown in figure [3J This may further confirm the 
scaling behavior of the recurrence interval PDFs. Moreover, for large t/(t), P q (r\T ) with 
large r is larger than that with small To, and for small r/(r), P cj (t\to) with small To is larger 
than that with large r . We find large (small) preceding intervals To tend to be followed by 
large (small) intervals r, and this observation indicates that there exists a short-term memory 
in the recurrence intervals of the Chinese stock markets. 



4.2. Detrended fluctuation analysis 

To investigate the long-term memor y o f the recurrence intervals, we adopt the detrended 
fluctuation analysis (DFA) method Oil . l32l l33l 134 , 13511 . known as a general method of 
examining the long-term correlation in time series analysis. The DFA method computes the 
detrended fluctuation F(l) of the time series within a window of / points after removing a 
linear trend. For long-term power-law correlated time series, F(l) is expected to scales as a 
power law with respect to the time scale / 

f(d ~ r, (io) 

where the DFA scaling exponent a is supposed to be equal to the Hurst exponent when a < 1 
Q360 : if 0.5 < a < 1 the time series are long-term correlated, and if a = 0.5 the time series are 
uncorrected 113 U 13211 . 

Figure 0] plots the detrended fluctuation F(l) of the recurrence intervals for different 
values of negative threshold q for the two Chinese indices. The curves show linear behavior 
in the double logarithmic plot, which indicates F(l) obeys a scaling form as in Eq. (fTOl) . The 
DFA exponent a is estimated by fitting the slope of the curves in figure HI We do the same 
calculation to obtain the DFA scaling exponents a for the 20 individual stocks, and plot the 
exponents in the right panel of figure 0] The exponent a shows a decreasing tendency as the 
decrease of negative threshold q, but shows values apparently larger than 0.5. This suggests 
that the recurrence intervals are long-term correlated. 
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Figure 4. Detrended fluctuation F(l) of recurrence intervals for (a) SSEC and (b) SZCI. The 
curves are vertically shifted for clarity, (c) Exponents a of negative returns (black circles), 
recurrence intervals (open symbols) and shuffled recurrence intervals (filled symbols) for 2 
Chinese indices and 20 stocks. 



Empirical studies have shown the long-term memory of the recurrence intervals may 
arise from the long-term memory of its original time series To verify 
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this, we calculate F(l) of the negative return series, remaining the position of the positive 
returns but ignoring its contribution to the calculation of F(l). As it is shown in Figure HI 
the exponents a of the negative return series for all the 20 stocks and the two Chinese 
indices (represented by black circles) are significantly larger than 0.5, and therefore the 
negative return series are long-term correlated. We then calculate the exponent a of the 
recurrence interval of the shuffled price returns which are artificially uncorrelated. The 
exponents a for all the individual stocks and the two Chinese indices tend to be very close 
to 0.5, and the recurrence intervals of the shuffled data are consequently uncorrelated. This 
observation provides direct evidence to confirm our assumption that the long-term memory of 
the recurrence intervals may due to the long-term memory of the price returns (either positive 
returns or negative returns). 



5. Risk estimation 



The study of recurrence interval between extreme events in stock markets has draw much 
attention of scientists and economists. Most of studies focus on investigating the statistical 
properties of recurrence interval sequences and understanding its fundamental dynamics. Not 
much work has been done applying this recurrence interval analysis to risk estimation for real 
stock markets. In the following paper we attempt to do some risk estimation for Chinese stock 
markets based on this recurrence interval analysis following Ref. lfl7[[l8[ 19. 201. 



5.1. Probability W q (At\t) 

In risk estimation, a quantity of great importance is the probability W q (At\t), that an extreme 
event with price return below q < occurs within a short time interval At <sc (r), conditioned 



on an elapsed time t after the occurrence of the previous extreme event Ill8ll : 



W q (Att) = ±—^- . (11) 

f t P q (j)dT 

Previous study has shown that P q (r) obeys a power law as in Eq. © for t/(t) > x min . When 
t > ^min(T), it can be algebraically derived that 

W q (At\t) - (6 - I) At ft. (12) 

W q (At\t) is supposed to be proportional to At and inversely proportional to t. Figure |5] plots 
W q (At = \0\t) for the two Chinese indices and four representative stocks, and apparently 
Eq. (fT2l) fits the probability W q (At\t) well for t > x min {T). It is worth pointing out that 
W q (At\t) shows an increasing tendency for extreme large t because of the poor statistics of rare 
events with large recurrence intervals. Measurements for other values of At show qualitatively 
similar results. Here an intriguing feature is that W q (At\t) is independent of the threshold q, 
which is a direct consequence of the scaling behavior of P q (r) shown in Eq. ©. 



CONTENTS 



11 




500 1000 1500 500 1000 1500 500 1000 1500 




t t t 



Figure 5. Comparison of W 9 (Af = 10|f) estimated empirically with that obtained from Eq. ( fT2b 
for (a) SSEC, (b) SZCI, and (c)-(e) four representative stocks 600009, 600060, 000002 and 
000063. The curves for q = -2.5 and q — -3 have been shifted vertically by factors of 0.1 and 
0.01 for clarity. 



5.2. Loss probability p* 

It is well known that the intraday returns r At of stocks and indexes are distributed according to 
a Student distribution [|40|], whose tails follow the inverse cubic law |4JJ]. Hence, its tail obeys 
a power law as 

p(r) = k\r\-^ +l \ (13) 

where the exponent (3 increases with the increase of At. When At equals one minute, the 
inverse cubic law holds [14011 - that is /3 « 3 B41I] . Empirical studies show that the tails of the 
return distributions for \r\ > 2 follow Eq. ([T3l and the exponent J3 displays values close to 3 as 
illustrated in figure [6] 

For a given risk level of VaR = q < 0, the probability p* of loss is 

Cl p{r)dr = ^\q\-P, q < -2. (14) 

The mean interval (r) is defined as 

total number of return samples 

<t> = . (15) 

number of return samples that r < q 

By definition, the inverse mean return represents the probability p* and should follow 

P=^r = -M~ p ,q<-^ (16) 

<T> fi 

Figure [7] plots l/(r) as a function of \q\ for the two Chinese indices and four representative 
stocks, and the estimated exponents show values very close to that of the return distributions 
which verifies the relation in Eq. ( fT6l ). 
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Figure 6. Probability distribution P(r) of normalized return r for (a) SSEC, (b) SZCI, (c)-(e) 
4 representative stocks 600009, 600060, 000002 and 000063. The solid lines are power-law 
fits for P(r < 0). 
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Figure 7. Reciprocal of mean recurrence interval 1 /(t) as a function of absolute threshold \q\ 
for (a) SSEC, (b) SZCI, and (c)-(e) four representative stocks 600009, 600060, 000002 and 
000063. The solid lines are fitted power-law curves. 



5.3. Conditional loss probability p* 

The conditional mean recurrence interval (t\tq) is defined as mean recurrence interval 
conditioned on the preceding interval To. In Figure [81 we plot the scaled conditional mean 
recurrence interval (t|t )/(t) as a function of the scaled preceding interval r /(r) for the two 
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Chinese indices and four representative stocks. We assume that (t\t )/(t) follows 



(T\T ) 

(t) 



_[0_ 

(r) 



(17) 



As shown in figure[8l the relationship between (t\t q )/(t) and tq/(t) could be nicely described 
by Eq. ([T7T) in the medium region of t /(t) e (0.1, 10]. For large and small t /(t), (t\t )/(t) 
for different g values evidently diverges, and could not be well fitted by Eq. (flTT) . 
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Figure 8. Scaled mean conditional recurrence interval (t|to)/(t) as a function of scaled 
recurrence intervals tq/(t) for (a) SSEC, (b) SZCI, and (c)-(e) four representative stocks 
600009, 600060, 000002 and 000063. 



Based on the conditional mean recurrence interval, we further calculate the loss 
probability conditioned on the preceding interval r . Similar to Eqs. ([141) and (fT6l) . we expect 
that 

P* = 77^-= C p(r\r )dr, (18) 
<t|t > J_oo 

where p(r\ro) is the probability that a return r immediately follows interval r , and p* is 
the loss probability for a risk level of VaR = q < conditioned on the preceding interval 
To of losses below q. If we know the preceding interval r , we can estimate the risk level 
corresponding to a certain loss probability p* . Substituting Eq. (fT6l) into Eq. (fT71) . we obtain 
the expression of l/(r|r ) depends on to/(t) and q. In figure we plot the contours of 
the theoretical conditional loss probability p* = 1/(t|t ) using the parameters estimated 
from figures [7J and [8] For comparison purposes, we also plot the the contours of empirical 
conditional loss probability in figure \\0\ Patterns of the two contour maps are similar except 
for large and small r /(r). The large values of empirical loss probability for large or small 
to/(t) indicate the high risk of extreme loses event conditioned on a long or short elapsed 
time since the previous extreme event. To better theoretically estimate the conditional loss 
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probability, we need to further refine the relationship between (t|t )/(t) and t /(t) in Eq. (fTTT ) 
for large and small t /(t). 
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Figure 9. Contour maps of theoretical loss probability p* for (a) SSEC, (b) SZCI, and (c)-(e) 
four representative stocks 600009, 600060, 000002 and 000063. 




6. Conclusion 



We have studied the probability distribution and the memory effect of recurrence intervals 
between two consecutive returns above a positive threshold q > or below a negative 
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threshold q < for SSEC, SZCI and 20 liquid stocks of the Chinese stock markets. Since our 
data sets are 1-min high-frequency returns, the statistics of our findings are much better than 



previous results using daily data |U7U18U19H. 

We found that the PDFs of recurrence interval with different thresholds collapse onto a 
single curve for each index and individual stocks and are symmetric with respect to positive 
and negative returns. The tails of the recurrence interval distributions for different values 
of threshold q follow scaling behavior, and the goodness-of-fit test shows that the scaling 
function of 16 stocks and SZCI could be approximated by power law under the significance 
level of 1% using the KS , KS W and CvM statistics. 

The investigation of the conditional PDF P q (T\To) and the detrended fluctuation function 
F(l) demonstrates the existence of both short-term and long-term correlation in the recurrence 
intervals. To the best of our knowledge, the memory effects of recurrence intervals of financial 
returns have not been studied previously, although the memory effects of volatility recurrence 
intervals are well documented. The long-term correlation in the recurrence intervals is usually 
attributed to the long-term correlations in the original time series JfJ. In the current case, 
although the returns are uncorrected, the positive returns are strongly correlated, and so are 
the negative returns. Therefore, the long memory in the recurrence intervals can also be 
attributed to the long memory in the positive and negative return series. 

We also apply the recurrence interval analysis to the risk estimation for Chinese stock 
markets, and provide relatively accurate estimations of the probability W q (At\t) and the 
conditional loss probability p* . These results are useful for the assessment and management 
of risks in financial engineering. We submit that recurrence interval analysis of financial 
returns pioneered in Ref. ifni [la . 19, 20] has the potential power to forging a link between 
econophysics and financial engineering and more work should be done in this direction. 
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